Picture for Sudong Wang

Sudong Wang

ParaVT: Taming the Tool Prior Paradox for Parallel Tool Use in Agentic Video Reinforcement Learning

Add code
May 21, 2026
Viaarxiv icon

WorldReasonBench: Human-Aligned Stress Testing of Video Generators as Future World-State Predictors

Add code
May 11, 2026
Viaarxiv icon

Unified-MAS: Universally Generating Domain-Specific Nodes for Empowering Automatic Multi-Agent Systems

Add code
Mar 23, 2026
Viaarxiv icon

Training Multi-Turn Search Agent via Contrastive Dynamic Branch Sampling

Add code
Feb 03, 2026
Viaarxiv icon

Resource-Efficient Reinforcement for Reasoning Large Language Models via Dynamic One-Shot Policy Refinement

Add code
Jan 31, 2026
Viaarxiv icon

AMA: Adaptive Memory via Multi-Agent Collaboration

Add code
Jan 28, 2026
Viaarxiv icon